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Abstract 

Online users nowadays are facing serious information overload problem. In recent years, recommender systems have been 
widely studied to help people find relevant information. Adaptive social recommendation is one of these systems in which 
the connections in the online social networks are optimized for the information propagation so that users can receive 
interesting news or stories from their leaders. Validation of such adaptive social recommendation methods in the literature 
assumes uniform distribution of users' activity frequency. In this paper, our empirical analysis shows that the distribution of 
online users' activity is actually heterogenous. Accordingly, we propose a more realistic multi-agent model in which users' 
activity frequency are drawn from a power-law distribution. We find that previous social recommendation methods lead to 
serious delay of information propagation since many users are connected to inactive leaders. To solve this problem, we 
design a new similarity measure which takes into account users' activity frequencies. With this similarity measure, the 
average delay is significantly shortened and the recommendation accuracy is largely improved. 
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Introduction 

The information and communication technologies lead us to an 
information-rich era where recommender systems are widely used 
to filter out irrelevant information [1-3]. Recommendation 
algorithms include correlation-based collaborative filtering [4—6], 
Bayesian clustering [7], probabilistic latent semantic analysis [8], 
matrix decomposition [9, 10] . Many issues related to recommender 
systems have been studied, such as the diversity of recommenda- 
tions [11,12], the effect of network topology [13], and ground user 
[14]. Recent researches show that social influence [15] is more 
powerful than the purely mathematical analysis based recommen- 
dation, as people are more likely to accept the recommendations 
coming from their friends or peers [16]. Hence, a new technology 
named social recommendation has emerged [17-19] in which 
users (followers) can select some other users as information sources 
(leaders) and the information wiU automatically flow from leaders 
to followers. This framework has been successfully applied in many 
real online websites, such as delicious.com, tuiitter.com and digg.com. 
The information can refer to news, movies, books, bookmarks, and 
so on. Without losing any generality, news is used as an example in 
this paper. When a piece of news is submitted or approved by a 
user, it will be forwarded to her followers. The diffusion of news 
thus depends on the structure of leader-foUower network, with 
higher transmission probability of news if users with higher similar 
tastes are linked [20]. 

Recently, an adaptive news recommendation model is proposed 
[21]. In this model, when a user reads news, she can either 



"approve" or "disapprove" it. If approved, the news will be 
forwarded to her followers. With the spreading of news, the leader- 
follower network will be updated, that is, the least suitable leader 
of a user wiU be replaced by a better one according to the quality 
of the leader. The quality of her leader is measured by the 
similarity based on their past assessments on news. The model has 
been extensively tested by additional aspects like users' reputation 
[22], implicit ratings [23], local topology optimization [24], 
leadership structure [25] and link reciprocity [26] . More recently, 
Cimini et al. [27] introduced two settings for modeling users' tastes 
and showed that the heterogeneous setting of users' tastes was 
closer to the real case than homogeneous setting. 

Confirmed by many empirical analysis, it is now well-known 
that the activity frequency of online users are heterogenous [28- 
30]. However, in the original adaptive news recommendation 
model and the following studies, users are randomly selected to be 
active (i.e. submitting or reading news), which indicates that the 
activity frequency of users are set to be homogenous. In this paper, 
we find that the propagation of news is seriously delayed when 
some classic similarity metrics are applied in the heterogenous 
users activity setting. Moreover, the recommendation accuracy 
(i.e. approval fraction of the news) is lowered as well. To solve this 
problem, we propose a new similarity measure which takes user 
activity into account. The simulation shows that the propagation 
delay is considerably shortened and recommendation accuracy is 
largely improved. Finally, we introduce a more general similarity 
definition in which the weight on users' news assessments and 
users' activity is tunable. With this, the effectiveness (accuracy) and 
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efficiency (time delay) of the information propagation is further 
improved. 

Materials and Models 

Empirical analysis 

To begin our analysis, we study the distribution of user activity 
frequency in real systems. Here, we consider the dataset of 
digg.com [31], which contains 3,018,197 votes on 3,553 popular 
stories made by 139,409 distinct users over a period of a month in 
2009. In this dataset, users and stories form a bipartite network in 
which a link between a user / and a story a. exists if user i reads 
story a. The degree of a user represents the number of stories read 
by her (i.e. the activity of this user). Figure 1 shows the degree 
distribution in three observation time windows. Clearly, the 
distribution follows a power-law with exponent around —2. The 
results confirm that users are very heterogenous in the frequency 
of their online activity. 

Furthermore, we employ the Kendall's tau coefficient (t) to 
calculate users' correlation of activity frequency in two adjacent 
periods. The length of the period is set as one day, so we are 
actually calculating the Kendall's tau coefficient of users' activity 
frequency in each day and the previous day. As shown in Fig. 2, T 
is always larger than zero, which means that users' activity 
frequency is positively correlated in time. Moreover, we observe 
that there is some periodic fluctuation in Fig. 2. With the actual 
dates, we check carefully the reason for this periodic fluctuation. 
We find that each period is one week, and the correlation is higher 
in weekdays than in weekends. We conjecture it is because 
people's live is regular in weekdays but diverse in weekends. 

Model description 

In the original adaptive news recommendation model [21] and 
the following studies [22-27], users' activity frequencies are 
assumed to be homogenous, which is inconsistent with the results 
of above empirical study. To make the information propagation 
model closer to the real system, we introduce the heterogeneity of 
users' activity frequency to it. Our model will be directiy built on 
the original news-sharing model in ref. [21]. The system consists of 
IJ users. Each of them is connected by directed links to L other 
users, who represent her news sources and to whom we refer as her 
leaders. The value of L is fixed as users can follow a limited number 
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Figure 1. The distribution of activity frequency of users on 
Digg.com over a period of a month in 2009. 

doi:10.1371/journal.pone.0096614.g001 
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Figure 2. The correlation of the activity frequency of 
users in two adjacent days. The actually dates are 
marked in the horizontal axis. ^>a- in the legend means that we only 
take into account the users with total activity being larger than x. 
doi:10.1371/journal.pone.0096614.g002 

of sources. Users receive pieces of news from their leaders, and 
eventually assess them. In addition, they can introduce new 
content to the system. 

Evaluation of news a by user i (e,a) is either + 1 (liked), — 1 
(disliked) or 0 (not read yet). The set of evaluations from any pair 
of users / and j is the basis to compute their similarity of their 
interests (or reading tastes), which is denoted as .?//. The explicit 
recipes to compute users' similarity are presented in the next 
section. Note that, apart from their evaluations, no other 
information about users is assumed by the model. 

Users' activity. In each time step of the simulation, a given 
user is active with probability Pa- When active, a user reads the top 
R news from her recommendation list, immediately forwarding 
the ones she likes to her followers. In addition, with probability ps 
she submits a new piece of news. Different from the original model 
in [21], the users' activity frequency is drawn from a power-law 
distribution as P{Pa)~P a where y = 2. 

Propagation of news. When news a is introduced to the 
system by user / at time /«, it is forwarded from / to the users j who 
have selected her as a leader, with a recommendation score 
proportional to their similarity J//. If this news is later liked by 
one of her followers j, it is similarly passed further to this user's 
followers q, with recommendation score proportional to Sjq, and so 
on. For a generic user q at time t, a news a is recommended to her 
according to its current score: 



(1) 



where is the set of leaders of user q. b\s, 2l Dirac delta function 
with only two possible values: 0 and 1. If user q has not read news 



a, be^j^f) = 1 since e, 
eq'i 7^ 0. Similarly, 



0 and if q has read news a. 
,1 = 1 if user / likes news a, otherwise 



&eqa,fi = 0 slnCC 



- 0. To make the fresh news fast accessed, recommendation 



scores are damped with time [1 6 (0,1] is the damping factor). 

Leader selection. The model is adaptive. Initially, each user 
randomly select L other users as her leaders. Leader-follower 
connections are periodically rewired to make the social network 
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approach an optimal state where only highly similar users are 
connected [32]. In each rewiring, for user /, her current leader j 
with the lowest similarity value is replaced with a new user [q] if 
Siq>Sij. There are different selection strategies for picking new 
candidate leaders, which are discussed in detail in [22,24,26]. In 
this paper we employ a hybrid strategy in which the user q is 
picked at random in the network with probability 0.1, otherwise 
she is selected among the leaders' leaders and followers of user i to 
maximize Siq. This mechanism well mimics users establishing 
mutual friendship relations, searching for friends among friends of 
friends, and having casual encounters which may lead to long- 
term relationships. In addition, it is an excellent compromise 
between computational cost and system's performance [24]. 

Measure of users' similarity 

An essential ingredient of the social recommendation algorithms 
is the estimated similarity of users' reading tastes, which regulates 
the news' flow over the system by determining the leaders' 
selection from users (i.e., the link structure of the network) and 
recommendation scores of news. Since only users' ratings and 
records of activities are known, the similarity of a pair of users has 
to be estimated from their assessments on news, which in our case 
can be either approved, disapproved, or not rated. 

The first similarity measure considered is introduced in [21] as 



(0)_ iAn-4,-i+iAnA-i 



(2) 



where Ai(Aj) is the set of news approved by user i (user j), Di(Dj) 
is the set of news disapproved by user / (user j), and Ni(Nj) is the 
set of news read by user / (user j). The term 1 — 1 /^Nif]Nj is used 

to remove the effect of statistical fluctuation. If a user / and a user j 
share a small number of commonly read news, they are more likely 
to achieve "perfect" similarity 1. After multiplying this term, the 
similarity measure wiU give this user pair a very low similarity 
value. In sampling of « trials, the typical relative fluctuation is of 
the order of l/\/n. Therefore, we select the above form. 

In [33], it is shown that Eq. (2) works well only in the system 
where tastes of users are homogeneously distributed, i.e., each user 
has the same number of interested fields. To achieve a more 
accurate leader assignment in the system where users can have 
different number of tastes, an asymmetric similarity measure is 
defined as 



\Aif]Ai\ 
\Aj\ 



(3) 



1 — 1 / \/\Aj\ here is also used to remove the effect of statistical 
fluctuation. 

In this paper, we consider the systems where users' activity 
distribution is uneven. Some users can be extremely active and 
read many news, so that their followers can constandy receive 
fresh news. On the other hand, if one user is connected to many 
inactive leaders, the news received by her wiU be very limited. 
Therefore, recommending highly inactive users is meaningless. 
Considering this, we modified the similarity in Eq. (3) as 



are many other previous works showing that online users' activity 
frequency is unevenly distributed[34,35]. 

Users' active frequencies Pa are users' inherent feature and 
unknown by the recommender system. We design the following 
way to estimate Pa of the users. Instead of taking the whole history 
into account, we only use the recent record of activities within a 
time window with length T (In our simulation, 7" = 250 

generally works best, see Supporting Information SI). The 
estimated probability for user j to get online is 



Pjit)-- 



fM') 



(5) 



where f/iw) is user fs online times from time U' to time t—\. If 

fjiw) equals 0 for each w, we set Pjit) as ^^^^ In Eq. (5), 

users' recent record plays a more important role. This is very 
useful in real systems, since the correlation of real users' activity 
frequency is generally high in short term (See Fig. 2). 

However, Pj(t) cannot be directly used as in Eq. (4). Since 
Pi(t)~ P(pa), Pj{t) follows a power-law distribution. If it is used as 
Hjit), some users with high activity will dominate the similarity 
matrix and be always selected as the leaders of others. In order to 
solve this problem, we proposed a logarithmic way to embed Pj{t) 
in Hj(t). After normalization, it reads 



Hi(t)-- 



^(InP,(0-Inxo) 
In xa 



1 



(6) 



-. After 



where A'q is the possible lowest value of Pj( t), set as 

simplification, Hj(t) = 1 + logj-j 
can distinguish different users and the most inactive users are 
punished severely. However, the majority gets Hj{t) over 0.5, as 



T(T+\)' 
T+\) Pji^)- 111 this definition, Hj(t) 



(2) 

shown in Fig. 3. We rewrite Sy as 
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1+log 



nr+i) 



(7) 
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where Hj(f) is a measurement of user /s activity. Actually, there 



Figure 3. The relation between Pj and Hj. 

doi:1 0.1 371/journal.pone.009661 4.g003 
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Table 1. List of parameters used in simulations. 



parameter 


symbol 


value 


Number of users 


U 


3498 


Number of leaders per user 


L 


10 


Dimension of taste vectors 


D 


20 


Minimum active elements per vector 


D,„i„ 


4 


Maximum active elements per vector 


D,„„ 


8 


Users' approval threshold 


A 


3 


Index of distribution 


V 


-2 


Probability of submitting a news 


Ps 




Number of news read when active 


R 


3 


Damping of recommendation score 


I 


0.9 


Base similarity for users 


■So 


10-' 


Period of the rewiring 


M 


10 



doi:l 0.1 371/journal.pone.009661 4.t001 



In oiu' simulation, we actually compare our method to with 
some start-of-the-art similarity methods based on both news- 
reading and topology. The results show that J*^' can outperform 
others, see Supporting Information SI. In the following, we will 
study the behavior of the system under these similarity metrics. For 
numerical tests of the model, we use an agent-based framework. 

Agent-based simulations 

To model users' judgments of read news we use a vector model 
where tastes of user / are represented by a Z)-dimensional taste 
vector a, = (a\ ,...,af ) and attributes of news a are represented by a 
Z)-dimensional attribute vector boi = (bl,...,h^). Similar vector 
models are often used in semantic approaches to recommendation 
[36]. Opinion of user / about news cc is based on the overlap of the 
user's tastes and the news's attributes, which can be expressed by 
the scalar product 



(8) 



We assume that user / approves news a only when £l,c > A, 
disapproves otherwise, where A is the users' approval threshold: 
the higher it is, the more demanding the users are. Here, we adopt 
the heterogeneous setting of the taste/ attribute vectors. 

Each user has preference for a variable number of D available 
tastes. Each taste vector has a different number of elements equal 
to one (active tastes, denoted as d) and the remaining elements are 
zero. In this paper, we assume D„,i„<d <D„,ax where and 
Dmax are the minimum and maximum number of active tastes that 
users can have, respectively. Moreover, we assume that each news' 
attribute vector has a fixed number Z)^™ of active attributes 
(number of ones), which are randomly chosen among the active 
tastes of the user who submits it. 

Simulation runs in discrete time steps. Assuming no a priori 
information, the starting network configuration is given by 
randomly assigning L leaders to each user. Then in each 
simulation step, an individual user is active with probability Pa- 
When active, the user reads and evaluates the R top-recom- 
mended news she has received and with probability ps submits a 
new news. Connections are rewired every u simulation steps. 
Parameter values used in all following simulations are given in 



Table 1 . A detailed study of the effect of the parameters on the 
model can be seen in [24]. 

Metrics 

Three metrics are used to measure the performance of the 
recommendation models: approval fraction i/', average differences 8, and 
average delay (j). 

Approval fraction t/^. The ratio of news' approvals to all 
assessments is an obviously important measure of the method's 
performance. This number, referred to as approval fraction, tells 
us how often users are satisfied with the news they get 
recommended. The higher the i/' is, the more accurate the 
recommendation is, and users are more satisfied correspondingly. 
It can be defined as 



(9) 



where term S^.^^i equals one if user / approved news a and zero 
otherwise, and term <5|t..^|j equals one if user / has rated news a 
and zero otherwise. 

Average differences 5. In the computer simulation, we have the 
luxury of knowing users' taste vectors and hence we can compute 
the number of differences between the taste vector of a user and 
the taste vectors of the user's authorities. By averaging over all 
users, we obtain the average number of differences. Obviously, the 
less are the differences, the better is the assignment of authorities. 
The average differences are defined as 



EE 

i IgL; 



\\ai-di\\ 
\Li\ ' 



(10) 



where L,- is the set of leaders of user /, and a,- (a/) is the taste vector 
of user (user /). 

Average delay (j). The freshness of the news is very important. 
Once the news becomes old, it is of no interest to users. The 
average delay measures the novelty of the news read by users. A 
small average delay indicates that users are always reading fresh 
news. The average delay is defined as 
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c 
3 



01 

z 



(11) 



where A^,- is the set of news read by user i, tj^ is the time when / 
reads news a, is the submitted time of news a, and Us is set of 
users. The smaller the ^ is, the news read by users will be fresher. 

Results and Discussion 

We now study the described adaptive social recommender 
system under difiFerent definitions of the similarity measure 
employed. For comparison, initial conditions and parameters for 
all simulations are identical, as listed in Table 1 . We obtained the 
average differences, approval fraction and average delay resulting 
from each similarity definition. We first consider the case where 
the user activity and the number of user interest are uncorrelated 
and the results are shown in the uncorrelated case in Table 2. As 
expected, J*'' enjoys a higher approval fraction and a smaller 
average difference than i'"*. The results are consistent with ref 
[27,33]. However, j'"* results in a smaller average delay than '. 
Among these methods, performs the best in all three metrics. 
These results suggest that introducing the users' activity to the 
similarity measure can significantly speed up the propagation of 
news in online systems so that users mostly receive fresh news. 
Moreover, it improves the leader assignment, resulting in a more 
accurate recommendation of news for users. 

One concern of the new similarity measure is that the network 
updating algorithm might focus too strongly on the activity 
frequency of leaders rather than the taste overlap of the leader and 
follower, putting the information recommendation accuracy at 
risk. Accordingly, we further introduce a parameter r] to adjust the 
effect of users' activity in the i'^' similarity calculation: 



1.1 



1.1 



(12) 



Parameter r} controls the weight assigned to Hj(t). When t] = 0, 
Eq. (12) reduces to i*'*, and when t]=l, Eq. (12) reduces to s^^\ 

The stationary values of average delay (j) and approval fraction 
ijj obtained by using i*^' under different rj are reported in Fig. 4. 
One can immediately see that there is a maximum approval 
fraction when adjusting rj. On the other hand, the average delay 
drop dramatically once r]>0 and then decreases monotonously 
with t]. Compared to the case where the users' activity is not 
considered in the similarity calculation [rj = 0), the approve fraction 
can be improved and average delay can be considerably 
shortened. Interestingly, the optimal is around 1, corresponding 
to the case of s^^\ 

We further consider the situation where the user activity and the 
number of user interest are correlated. Three cases are considered 
in this paper: positive correlation, negative correlation, and no 
correlation. We first compare three evaluation metrics under 
difiFerent correlation settings (see Table 2). One can immediately 
see that both positive and negative correlation between the user 
activity and the number of user interests can significandy shorten 
the average delay (j) of the news. However, the delay from and 
are stiU longer than s^^\ The advantage of can also be 
found in the average difference S. A lower average difference 
indicates that the network is adapted to a better state for news 
propagation. We can also see that enjoys the highest approval 
fraction ij/ in almost all cases. When the correlation between the 
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Figure 4. Stationary values (simulation step #15000) 
of average delay ^ and approval fraction xji in the adaptive 
system ruled by sf^ , and for different values of the parameter r\. 

doi:10.1371/journal.pone.0096614.g004 



user activity and the number of user interests is positive, tlie 
approval fraction of i'"' is a bit higher than that of J*^' . However, 
the delay in i'"' in this case is more than twice longer than that of 
i*^^ Taken together, j*^' is a very effective and robust similarity 
measure for recommending leaders in online social systems. 



The leader-follower networks after the systems reach stable state 
is studied. We first investigate the in-degree distribution of nodes 
(i.e. the distribution of number of followers). The results show that 
the largest in-degree in is smaller than that in j*"' and J*'*. This 
is because users in i'^' select leaders according to not only the 
similarity but also the activity frequency. It is more difficult for the 
largest in-degree nodes in i*^' to attract as many followers as in i^"' 
since these users have both high similarity to others and high 
activity frequency. We then study some properties of the users of 
different activities in Fig. 5. Fig. 5(a) shows the relation between 
user activity and the number of her followers. As discussed above, 
if the leaders of a user are with low activity, the user may have no 
news to read and the propagation of news will be largely delayed. 
This happens a lot in the original similarity measure j'"' and ,5''' 
(See the flat curves of them in Fig. 5(a)). We didn't plot the curves 
of negative and uncorrelated cases in j'*"' and j'*'' because they are 
as flat as in the positive correlation case. In s^^\ the users with 
higher activity frequency have more followers, which makes users 
with rich number of news to read. Moreover, we observe that the 
users with higher activity and fewer interests (see the negative 
correlation case when pA is large) have more followers. In ref [33], 
it is already pointed out that the users with few interests are good 
information resource and should be selected as leaders (since they 
are specialized in their fields). As shown in Fig. 5(a), i*^' 
recommends the users with high activity and few interests as 
leaders to others. This again supports that J*^* is a good similarity 
measure. 
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Figure 5. The (a) number of followers, (b) average activity of followers, (c)number of forwarded news and (d) spreading 
range of users with different activities 
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In Fig. 5(b), we present the relation between users' activity and 
the average activity of their followers. We can see that the users 
with higher activity and wider interests (i.e. large Pa in positive 
correlation) have more active followers. Generally, the interests of 
the followers are wider than that of the leaders [33]. The followers 
of large Pa users will have wide int(;r(;sts and thus high activity. 

Moreover, it is interesting to identify which kind of users can be 
the information hubs in online social networks. We first investigate 
the number of forwarded news of different users. As shown in 
Fig. 5(c), the users with higher activity and wider interests forward 
more new.s. With wider interests, these users are more likely to 
approve the news from their leaders, which results in a large 
number of forwarded news from them. 

However, the results in Fig. 5(d) indicate that the users with high 
activity and wide interests are actually not information hubs. We 
report the spreading range when the news is originated from 
different users in Fig. 5(d). The spreading range here is defined as 
the number of users who finally read the news. As discussed above, 
the users with fewer interests are more specialized in their fields 
and their followers are more likely to approve the news from them. 
Therefore, the news originated from users with higher activity and 
fewer interests will spread wider. The results imply that the active 
and specialized users are the information hubs in online social 
networks. 

Conclusion 

In this paper, we study a new multi-agent based model for 
information propagation and recommendation on online social 
network. The original online information propagation model was 
proposed in ref [21] where users' activity frequency is assumed to 
be homogeneously distributed. Since the empirical study of the 
online news-sharing systems suggests that users' activity frequency 
distribution actually follows a power-law distribution, we introduce 
the heterogeneity to users' activity frequency distribution to the 
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